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Abstract. In this work, the possible dynamics associated with leptophilic Zi boson 
at CLIC (Compact Linear Collider) have been investigated by using artificial neural 
networks (ANNs). These hypotetic massive boson Zi have been shown through the 
process e + e~ — > fi + [i~ . Furthermore, the invariant mass distributions for final muons 
have been consistently predicted by using ANN. For these highly non-linear data, we 
have constructed consistent empirical physical formulas (EPFs) by appropriate feed- 
forward ANN. These ANN-EPFs can be used to derive further physical functions which 
could be relevant to studying Zi. 



PACS numbers: 07.05.Mh, 14.80.-j 



A Search for Leptophilic Vector Boson Z[ at CLIC by Using Neural Networks 



2 



1. Introduction 

The gauging of the baryon and lepton numbers has a long history. In 1955 Lee and 
Yang proposed massless baryonic photon PQ, later in 1969 Okun considered massless 
leptonic photon [2] in analogy with the baryonic photon. On the other hand gauging of 
B-L [31 E] is natural in the framework of Grand Unification Theories. Manifestations of 
the Z boson of the minimal B-L model at future linear colliders and LHC have been 
considered in recent paper [5]. In [6] we have considered phenomenology of massive U(l) 
boson coupled to lepton charge. 

The physical phenomena involved in massive leptonic (leptophilic) boson Z\ are 
characteristically highly nonlinear. Therefore, it may be difficult to construct explicit 
form of empirical physical formulas (EPFs) relevant to Z[. Then, by various appropriate 
operations of mathematical analysis, derivation of potentially useful highly non-linear 
physical functions for Z[ is of utmost interest. Compatibly a previous theoretical 
treatment [7], appropriate EPFs relevant to Z\ can be built by using a feed-forward 
artificial neural network (ANN). As we will give more details in Section 2, the ANN is 
an universal non-linear function approximator [8]. 

Recently, ANNs have emerged with successful applications in many fields, including 
Higgs boson search [9j [TUJ, [HI [121 EE] ■ I n this study, compatible with our previous study 
[6], mass (M^J and interaction constant (gi) of Z\ at CLIC (Compact Linear Collider) 
with ^/s = 3TeV hane been investigated and obtained some limit values for Mz l and g\ 
by using ANNs. In all calculations we have performed, signal and background processes 
were e + e~ — > 7, Z, Z\ — > and e + e~ — > 7, Z — > /i + yU~ , respectively. Besides, the 

invariant mass distributions of final muons (M^+^J) for signal and background processes 
have been consistently acquired by using ANN. For numerical calculations, we have 
used the CALCHEP Simulation Program [TJ] and drived the data from this program 
as the ANN train and test set data. Also, we particularly aim to construct explicit 
mathematical functional form of ANN-EPFs for non-linear data relevant to Zi. While 
the calculated data were intrinsically highly non-linear, even so train set ANN-EPFs 
successfully fitted these data. Moreover, test set ANN-EPFs consistently predicted the 
data. That is, the physical laws embedded in the data were extracted by the ANN-EPFs. 

2. ANN and ANN-EPF 

The fundamental task of the artificial neural networks (ANNs) is to give outputs in 
consequence of the computation of the inputs. ANNs are mathematical models that 
mimic the human brain. They consist of several processing units called neurons which 
have adaptive synaptic weights [15]. The ANN consists of three layers named as input, 
hidden and output figure [T] The number of hidden layers can differ, but a single hidden 
layer is enough for efficient non-linear function approximation [8]. In this study, one 
input layer with one neuron, one hidden layer with many (h) neuron and one output layer 
with one neuron (1-h-l) ANN topology was used for investigation of Z\ at CLIC with 
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y/s = 3TeV. Analyses have been performed for different h numbers, but results have 
been generally given for most convenient h numbers. The total numbers of adjustable 
weights could be calculated by using formula given as 

pxh + hxr = hx(p + r) = 2h (1) 

where p and r are the input and output neuron numbers, respectively. 



Hidden 
Layer 




Figure 1. Fully connected one input-one hidden-one output layer ANN. and yi 
are input and output vector components, respectively. Circles are neurons and arrows 
indicate adaptable synaptic weights. Wj k : weight vector component, where i is a layer 
index, jk weight component from the jth neuron of ith layer and to kth neuron of 
(i+1 )th layer. 



The neuron in the input layer collects the data from outside and transmits 
via weighted connections to the neurons of hidden layer needed to approximate any 
nonlinear function. The hidden layer neuron activation function can be any well-behaved 
non-linear function. We have choosen the type of activation function for hidden layer 
as hyperbolic tangent equation [2] Note that instead of this function, any other suitable 
sigmoidal function could also be used. Finally, the output layer neuron returns the 
signal after the analysis. 

tanh = (2) 

e x + e -x w 

An ANN software NeuroSolutions v6.02 has been used for separate applications. 
In these applications, ANN inputs are M Zl , g\ and M M + M _ and corresponding desired 
outputs are cross sections for each input. For all ANN processing case, we have divided 
the data into two equal separate sets. One of these sets belongs to the training stage 
and the rest to the test stage. In the training stage, a back-propagation algorithm 
with Levenberg-Marquardt has been used for the training of the ANN. The maximum 
epoch number (one complete presentation of the all input-output data to the network 
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being trained) is 1000. ANN modifies its weights by appropriate modifications until 
an acceptable error level between predicted and desired outputs is reached. The error 
function measures the difference between outputs is mean square error (MSE). It is 
calculated as 

MgJTJ — Efc=l ^2i=l(yki ~ fki) ] ^ 

where N is the number of training or test samples, y^ and fki are the desired output and 
network output, respectively. Then by using ANN with final weights, the performance 
of the network is tested over the test set data never seen before by network. If 
the predictions of the test set data are well enough, the ANN is considered to have 
consistently learned the functional relationship between input and output. For our 
training stage, the minimum and maximum MSE values are 8 x 10~ 38 and 5 x 10 -3 
respectively, for our test stage 6 x 10~ n and 4 x 10~ 2 . In figure [2j the training MSE 
values for invariant mass distribution of final muons at CLIC with y/s = 3TeV for 
Mz, = ITeV have been shown as an illustration. 
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Figure 2. For invariant mass distribution of final muons at CLIC, the training MSE 
values versus epoch number. 



Owing to the fact that a single hidden layer feed-forward ANN is enough for non- 
linear function approximation [8] , in this paper we have used single hidden layer ANNs. 
We have only explained the single hidden layer feed-forward ANN functionality. From 
[8], the desired output vector y is approximated by a network multi-output vector / 
which is defined as 

h 

f:R p ^R r : f k {x) = PjGiAjix)); xeRP, fyeR, A^A V ', k = 1, r(4) 

3=1 

where x is the ANN input vector, A p is the set of all functions of R p — > R defined by 
A(x) = w ■ x + b, w is input to hidden layer weight vector, b is the bias weight and G 
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is hidden layer activation function. In figure [Tj the columns of the weight matrices w 1 
and w 2 correspond to weight vectors defined in A(x) and /3 in equation [4 However, as 
can be seen in figure l]and equation^ the correspondences w 1 — > A(x) and w 2 — > (3 are 
valid only for single hidden layer feed-forward ANN. 

Since a deterministic or random EPF is usually a mathematical vector function 
(y : BP — > R r ) between the physical variables under investigation, feed-forward ANN is 
relevant to EPF construction. The ANN defined by equation [4] is especially relevant in 
this context as a general input-output function estimator. Although there can be several 
independent variables (p > 1 in equation [5]) , dependent variable number is usually one 
(r = 1 in equation [4]). Training set data for both independent and dependent physical 
variables are presented to the input and output layers respectively. Then after an 
appropriate weight adaptation process, the ANN estimates the unknown generally non- 
linear EPF. 

3. The concrete algorithm for ANN-EPF construction 

In order to construct appropriate EPFs for highly non-linear cross sections of Mz n gi 
and M u + U _ we have used one neuron output ANN vector function / in equation 4 
However, it gives the rough structure of the ANN without generating the final EPF 
parameters/final ANN optimal weights. Moreover, this equation is not sufficient for the 
complete construction of the desired non-linear EPF. Obtaining the final weight vector 
(wf) and the corresponding ANN output vector function f min = f(ulf) of equation 4 
we have simultaneously used the equations [3] and [4J f min is the network output vector 
function giving the minimum MSE by a convenient ANN weight adaptation. Note 
that, f m i n is the best non- linear estimation vector of the theoretically unknown desired 
output function y : BP — >■ R r . In other words, the unknown vector function y desired 
non-linear EPF is estimated by f m in depending on the structure of the network output 
vector function / and the final weight vector wj. In equation 4, / depends on the 
explicit forms of A and G functions and the components of the weight are embedded 
in A(x) and (3. By setting f3 = w 2 and A(x) = w 1 ■ x + b and G: tangent hyperbolic, 
we can construct explicit form of f and by minimizing of equation [3J we have finally 
obtained f min = f(wf). Eventually, the concrete ANN-EPF construction algorithms for 
non-linear cross sections are completed. By using these algorithms, the obtained actual 
ANN-EPFs results are given in Section 4. 

4. Results and discussion 

Note that, the ANN inputs and outputs used in this paper have been explicitly defined 
in equation |3J In the text where it suitably applies, the cross section term has been 
used for the ANN outputs related to ANN inputs (M Zl , gi and M M + At _). 
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4-1. ANN-EPFs for train set fittings 

During all the training stages, the number of data points was 50% of all data. For a 
single hidden layer ANN, the train set nno (neural network output) fittings for cross 
sections versus Mz l for different gi values (0.10, 0.20, and 0.30) were given in figure 
[3j Here, the best fitting was obtained for h=3 (h: hidden layer neuron number). In 
figures [4] and [5] the train set nno fittings for cross sections versus gi and M^+^_ were 
given, respectively. For M M +^_ two different M Zl values have been used and hidden 
layer neuron number which gives the best fitting is 7. In order to show effect of varying 
h, not only best one but also different ones of h (h=4, 7, 20) have been given for gi in 
figure |4| The nno fittings for each h values would have been densely nested inside each 
other. It can be clearly seen in all figures, the nno fittings agree exceptionally well with 
highly non-linear calculated (calc) data. Additionally, one can see that signals are well 
above the background in figures [3] and [5j 
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Figure 3. Calculated and nno training set fittings of cross section versus Mz l for 
different gi values at CLIC with ^fs = 3TeV. 

4-2. Consistency of the constructed ANN-EPFs: Test set predictions 

Unless the train set ANN-EPFs are tested over test set data, these fitted EPFs cannot 
be used consistently over a desired range of cross section values. If the predictions are 
consistent with the test set data values, then the ANNs can be taken as appropriate 
ANN-EPFs. 

The corresponding test set nno predictions of figures [3| [4] and [5] have been seen in 
figures |6j [7] and |8j respectively. The number of data points was 50% of all data. As can 
be seen in figures [6j [7] and |8j the nno predictions agree exceptionally well with highly 
non-linear experimental values. This obviously indicate that the test set ANNs of cross 
sections versus Mz l , gi and M /i + /J _ have consistently generalized the train ANN fittings. 
The physical law embedded in cross sections versus Mz n gi and M M + M _ data has been 
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Figure 4. Calculated and nno training set fittings of cross section versus gi for different 
h values at CLIC with </s = 3TeV. 
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Figure 5. Calculated and nno training set fittings of cross section versus M^+u— for 
SM signal and background at CLIC with ^/s — 3TeV. 

successfully extracted by the constructed ANN, so that the test set ANNs can be safely 
used as ANN-EPFs. It can be clearly seen that signals are well above the background 
in figures [6] and [8j 

5. Conclusions 

Future linear colliders, like CLIC, will give possibility for investigation leptophilic vector 
boson (Zi) with masses up to the center of mass energy if g\ > 1(C 3 . In this paper, as 
an alternative approach, we obtained cross sections for Mz n gi and M^+^_ via artificial 
neural networks (ANNs). We have investigated that the ANN method is consistent 
with our calculations from the CALCHEP simulation program. For highly non-linear 
cross sections for Mz l , gi and M M +^_ , we have constructed consistent empirical physical 
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Figure 6. Calculated and nno test set predictions of cross section versus Mz, for 
different gi values at CLIC with y/s — 3TeV. 




Figure 7. Calculated and nno test set predictions of cross section versus gi for different 
h values at CLIC with ^/s = 3TeV. 
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Figure 8. Calculated and nno test set predictions of cross section versus M /J + M _ for 
SM signal and background at CLIC with y/s — 3TeV. 
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formula (EPFs) by appropriate ANNs. The test set ANNs of cross sections versus Mz v 
gi and M^+^j. have generalized the train ANN fittings. Therefore, the test set ANNs 
can be surely used as ANN-EPFs since the physical laws embedded in cross sections 
versus Mz t , gi and M M + M _ data have been successfully extracted by the ANN. 
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